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Abstract. We investigate abelian repetitions in Sturmian words. We exploit a 
bijection between factors of Sturmian words and subintervals of the unitary seg- 
ment that allows us to study the periods of abelian repetitions by using classical 
results of elementary Number Theory. We prove that in any Sturmian word the 
ratio between the maximal exponent of an abelian repetition of period m and 
m converges to a number greater than or equal to y/5, and the equality holds 
for the Fibonacci infinite word. We further prove that the longest prefix of the 
Fibonacci infinite word that is an abelian repetition of period Fj, j > 1, has 
length Fj(F j+1 + F^-i + 1) - 2 if j is even or Fj(F j+1 + Fj^i) - 2 if j is odd. 
This allows us to give an exact formula for the smallest abelian periods of the 
Fibonacci finite words. More precisely, we prove that for j > 3, the Fibonacci 
word fj has abelian period equal to F n , where n = /2J if j = 0, 1, 2 mod 4, or 
n = 1 + b'/2j if j = 3 mod 4. 

Keywords: Formal Languages, Combinatorics on Words, Stringology, abelian repeti- 
tions, Sturmian words, Fibonacci words. 

1 Introduction 

The study of repetitions in words is a classical subject in Theoretical Computer Science 
both from the combinatorial and the algorithmic point of view. Repetitions are strictly 
related to the notion of periodicity. Recall that a word w of length \w\ has a period p > 
if w[i] = w[i+p] for any 1 ^ i ^ |u;| — p, where w[i] is the symbol in position i of w. Every 
word w has a minimal period p < \w\. If > 1, w is called a repetition of period 

p and exponent When |u>|/p = k is an integer, the word w is called an integer 

power, since it can be written as w = u k , that is, w is the concatenation of k copies of 
a word u of length p. If instead \w\/p is not an integer, the word w is called a fractional 
power. So one can write w = u k v, where v is the prefix of u such that \w\/p = k+ \v\/\u\. 
For example, the word w = aabaaba is a 7/3-power since it has minimal period 3 and 
length 7. A classical reference on periodicity is [29j Chap. 7]. 

Abelian properties concerning words have been studied since the very beginning 
of Formal Languages and Combinatorics on Words. The notion of Parikh vector has 
become a standard and it is often used without an explicit reference to the 1966 Parikh 
paper [35] , that is however cited more than 450 times. On the other hand abelian powers 
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were first considered in 1961 by Erdos 20 as a natural generalization of usual powers. 
Researches concerning abelian properties of words and languages never stopped and they 
developed in different directions (see for instance [§[^[^[^[2^[2§|^|t||39l|40||4l]). 
There is a recent increasing of interest on abelian properties linked to periodicity (see 
(^§|§0[^[^[^[19||23||39||^||42] and references therein). 

Recall that the Parikh vector V w of a finite word w enumerates the cardinality of 
each letter of the alphabet in w. For example, given the alphabet E — {a, b, c}, the 
Parikh vector of the word w = aaba is V w = (3, 1, 0). Roughly speaking, two words have 
the same Parikh vector if one can be obtained from the other by permuting letters. 

We say that the word w is an abelian repetition of (abelian) period m and exponent 
\w\/m if w can be written as w — uqU\ ■ ■ ■ Uk-iUk for words Ui and an integer k > 2, 
where for < i < k all the Mi's have the same Parikh vector whose sum of components 
is m and the Parikh vectors of uo and Uk are contained in V (s ee [13] ). When uq and 
Uk are empty, w is called an abelian power or weak repetition [15] . For example, the 
word w = abaab is an abelian repetition of period 2, since one can set Uq = a, U\ = ba, 
u 2 = ah and = e, where e denotes the empty word. 

It is well known that Sturmian words and Fibonacci words, in particular, are extremal 
cases for several problems related to repetitions (see for example 14 24 34]) and are 
worst-case examples for classical pattern matching algorithms, e.g. Knuth-Morris-Pratt 
[T 28 . There exists a huge bibliography concerning Sturmian words (see for instance the 



survey papers [4][5 27 , 29, Chap. 3], |38, Chap. 6] and references therein). In particular, 



there is an analogous result to the one presented in this paper concerning classical 
repetitions in the Fibonacci infinite word 33 . This result was subsequently extended to 



the class of Sturmian words in several other papers. In 31 , a bijcction between factors 



of Sturmian words and subintervals of the unitary segment is described. In this paper 
we show that this bijection preserves abelian properties of factors (see Proposition [3]). 
Therefore we are able to apply techniques of Number Theory coupled with Combinatorics 
on Words to obtain our main results. More precisely, we prove that in any Sturmian 
word the ratio between the maximal exponent of an abelian repetition of period m and 
m converges to a number greater than or equal to y/5, and the equality holds for the 
Fibonacci infinite word. We further prove that the longest prefix of the Fibonacci infinite 
word that is an abelian repetition of period Fj, j > 1, has length Fj{Fj + \ + 1) — 2 

if j is even or Fj(Fj + i + Fj-i) — 2 if j is odd. This allows us to give an exact formula 
for the smallest abelian periods of the Fibonacci finite words. More precisely, we prove 
that for j > 3, the Fibonacci word fj has abelian period equal to F n , where n = [j/2\ 
if j = 0, 1, 2 mod 4, or n = 1 + [j/2\ if j = 3 mod 4. 



2 Preliminaries 

Let E = {ai, 0,2, ■ ■ ■ , a CT } be a finite ordered alphabet of cardinality a and S* the set 
of words over S. We denote by \w\ the length of the word w. We write w[i] the i-th 
symbol of w and w[i. .j] the factor of w from the t-th symbol to the j-th symbol, with 
1 ^ i ^ j ^ |io|. We denote by |iy| a the number of occurrences of the symbol a G E in 
the word w. 

The Parikh vector of a word w, denoted by V w , counts the occurrences of each letter 
of E in w, that is, V w — (|^j 0l , . . . , |w| 0<r )- Given the Parikh vector V w of a word w, we 
denote by V w [i] its i-th component and by \V W \ the sum of its components. Thus, for 
a word w and 1 ^ i ^ a, we have V w [i) = \w\ a( and \V W \ = X^i^H — M- Finally, 
given two Parikh vectors V, Q, we write V C Q if V[i] ^ Q[i] for every 1 sC i ^ a and 
W\<\Ql 
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Following 13 , we give the definition below. 



Definition 1. A word w is an abelian repetition of period m > and exponent \w\/m if 
one can write w = uqU\ ■ ■ ■ Uk-iUk for some k > 2 such that V Ua C V Ul = ■ ■ ■ = V Uk _ 1 D 
V Uk , and \V Ul \ = . . . = |'P lt(f _ 1 | = m. An abelian power is an abelian repetition in which 
u = u k = e. 

We call uq and u k respectively the head and the tail of the abelian repetition. 
Notice that the length t = \u k \ of the tail is uniquely determined by h, m and \w\, 
namely t = (\w\ — h) mod m. 

In the rest of the paper, when we refer to an abelian repetition of period m, we 
always suppose that m is the minimal abelian period of w. 

Example 1. The word w — abaababa is an abelian repetition of period 2, since one can 
write w = a ■ ba ■ ab ■ ab ■ a. Notice that w is also an abelian repetition of period 3, since 
w = aba ■ aba ■ ba. 



2.1 Sturmian words 

Here and in the rest of the paper, we fix the alphabet £ = {a,b}. We start by recalling 
a bijection between factors of Sturmian words and subintervals of the unitary segment 



introduced in 31 



Let a and p be two real numbers with a € (0, 1). Following the notations of 22 



a n (ot,p) = 



the fractional part of a number r is defined by {r} = r — [r\, where [?~J is the greatest 
integer smaller than or equal to r. Therefore, for a <G (0, 1), one has that {—a} = 1 — a. 

The sequence {na + p}, n > 0, defines an infinite word s a . p = a\(a, p)a2(a, p) ■ ■ ■ by 
the rule 

bif {na + p} € [0, {-a}), 
a if {na + p} e [{-a}, 1). 

See Fig. [l] for a graphical illustration. 

We shall write a n instead of a n (a,p) whenever there is no possibility of mistake. If 
a = p/q, with p and q coprime integers, then it is easy to prove that the word s Q p is 
periodic and q is its minimal period. If instead a is irrational, then s atP is not periodic 
and is called a Sturmian word. Therefore, in the rest of the paper, we always suppose a 
irrational. 

Example 2. For a = <j) — 1 and p = 0, where <f> — (1 + Vo)/2 is the golden ratio, one 
obtains the Fibonacci infinite word 

/ = s a.o = abaababaabaababaababa 

Remark 1. Since a G (0, 1), we have {— ia} { — (i + l)a} for any natural number i. We 
shall use this fact freely and with no explicit mention. 



It is possible to prove (see 31 Corollary 2.3]) that the following proposition holds. 

Proposition 1. Let a and p be real numbers, with a G (0, 1) irrational. For any natural 
numbers n, i, with n > 0, if {— (i + l)a} < {—ia} then 

a n+i =a {na + p} E [{-(i + l)a},{-ia}), 

whereas if {—ia} < { — (i + l)a}) then 

a n+i = a <^ {na + p} e [0, {-ia}) U [{-(i + l)a}, 1). 
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Fig. 1. An application of Proposition [T] when a — (f> — 1 ~ 0.618 (thus {—a} ~ 0.382) for i = 0. 
If {na + p} G [{—a}, 1), then a n = a; otherwise a n — b. 
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Fig. 2. An application of Proposition [T] when a — 4> — 1 ~ 0.618 (thus {—a} ~ 0.382) for i = 1. 
If {na + p} G [0, {—a}) U [{—2a}, 1), then a n +i = a; otherwise a n +i = b. 
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Fig. 3. A single graphic representation of the information given in Fig. [T] and [5] If {na + p] G 
[0, {-a}) = L (a, 2), then a n = b, a n+i = a. If {na + p} £ [{-a}, {-2a}) = Li(a, 2), then 
a n = a, a n+1 = b. If {na + p] G [{—2a}, 1) = L 2 (a,2), then a n = a, a n+ i = a. 



In Fig. [T] and [2] wc display a graphical representation of the formula given in Propo- 
sition [l] for a = (/> — 1 when i = and i = 1, respectively. In Fig. [3] we present within a 
single graphic the situations illustrated in Fig. [T] and [2j 

Let m be a positive integer. Consider the m + 1 points 0, 1, {— ia}, for < i < m. 
Rearranging these points in increasing order one has: 

= c (a, m) < ci(a, m) < ■ ■ ■ < Cfc(a, to) < • • • < c m (a, m) < c m+ i(a, m) = 1. 

One can also define the non-empty intervals Lk(a, m) = [ck(a, to), Ck+i(ot, to)), < k < 

TO. 

By using Proposition [l] it is possible to associate with each interval i^(a, to) a factor 



of length to of the word s Q p , and this correspondence is bijective (see 32 ). 



Proposition 2. Each factor of s a . p of length to, a„a„+i . . . a n _|_ m _i, depends only on 
the interval Lk(a,m) containing the point {na + p}; more precisely, it depends only on 
the set /fe(a, to) of integers i € {0, 1, ... , m— 1} swcft f/ia£ either {— (i+l)a} < {— iev} and 
Cfc(a,m) G [{ — (j + l)o/f, {— «a}) or {— (i+l)a} > {—ia} and c^(a,m) £ [{— ia},{— (i + 
l)a}). The set Ik{a,m) is the set of the integers i, < i < m — 1, such that a n+ i = a. 

Corollary 1. Since the set of factors o/s Q . p depends only on the sequence {—ia}, i > 0, 
it does not depend on p. 
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Definition 2. We call the bijective correspondence in Proposition^ the Sturmian bi- 
jcction. 

Example 3. Let a — 4> — 1. In Fig.[3]we show an example of the Sturmian bijection when 
to = 2. The ordered sequence of points defining the intervals L^{a, 2) is 

Co (a, 2) = 0, a(a, 2) = {-a} w 0.382, c 2 (a, 2) = {-2a} « 0.764, c 3 (a, 2) = f . 

In Fig. [4] we show an example of the Sturmian bijection when to = 6. Below each 
interval there is the factor of s a of length to = 6 associated with that interval. The prefix 
of length 6 of the Fibonacci word corresponds to the factor below the interval containing 
a (so, for n = 1 and p = 0). Notice that all the factors of length 6 of the Fibonacci 
word appear, and moreover they appear in lexicographic order from right to left. This 
property concerning lexicographic order holds for any Sturmian word and any length to 
of factors and it is proved in next proposition. 

Next proposition is not strictly linked to main results of this paper, but it is of 
independent interest and it is related to recent researches on Sturmian words (see [8{ |21| 
[25j[||). 

Proposition 3. For any m > 1 we have that k < k' if and only if the factor t a , p , m as- 
sociated to Lk(a, to) in the Sturmian bijection is greater than the factor t' a m associated 
to Lk'(a,m), following the lexicographic order where a < b. 

Proof. The proof is by induction on to. The base of the induction is true by very def- 
inition of the sequence of the two intervals Li (a, 1), i — 0, 1 and the case a = (j> — 1, 
is showed in Figure [T] Suppose that the property holds for to and let us prove it for 
to + 1. The sequence of intervals L{(a, m), i = 0, . . . , to is the same sequence of intervals 
Lj(a, to + 1), i = 0, . . . , to + 1 except for the interval Lt(a, to) containing {—(to + l)a} 
that is subdivided into two intervals in the second sequence. Therefore the prefixes of 
length to of the two factors £ Q ,p,m+i and t' a p m+1 are equal if and only if k 1 = k + 1 
and Lf,(a,m) = Lk{a,m + I) U L' k (a,m + 1). In this case the last letter of i Q , P: m+i is 
b and the last letter of t' a m+1 is a and the statement of this proposition is verified. 
Otherwise we use the induction on the prefix of length to of both factors. □ 

In this paper, we present a new semantic property of the Sturmian bijection, which 
will be described in the next section, that will allow us to use some Number Theory 
techniques to deal with abelian repetitions in Sturmian words and, in particular, in the 



Fibonacci word. Similar techniques were used in 40 to derive nice results on abelian 



powers on Sturmian words, that are repetitions with empty head and tail. 



3 On the Sturmian bijection and Parikh vectors 

Let Sq, )P be a Sturmian word. Since we are mainly interested in the set of factors of s a ,p; 
we do not lose generality, by Corollary [l] supposing p — 0. For simplicity of notation, 
we will therefore write s a instead of s Q! o- 

In this section, we describe some properties of the Sturmian bijection between the 
factors of length to of s a and the intervals Lk(a, to) that we will use in the rest of the 
paper. 

Proposition 4. Under the Sturmian bijection, all the factors corresponding to an in- 
terval Cfc(a, to) = [x, y) with x > {—ma} have the same Parikh vector vi(a,m) and all 
the factors corresponding to an interval [x,y) with y < {—ma} have the same Parikh 
vector V2(a,m). Moreover, one has Vi(a, m)[l] = i>2(a,TO,)[l] + 1. 
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Fig. 4. Subintervals Lh{a,m) obtained for a = <f> — 1 and m = 6. Below each interval there 
is the factor of s a of length 6 associated to that interval. For p = and n = 1, the prefix of 
length 6 of the Fibonacci word is associated to Li{a, 6) = [04(0, 6), cs(a, 6)), that is the interval 
containing a. 

Proof. We prove the statement by induction on m. The case m = 1 is true by definition. 
Suppose the statement true for m and let us prove it for m + 1. We have two cases: 

Case 1) { — (m + l)a} < {—ma}; 

Case 2) {—ma} < {— (m + l)a}. 

We give the proof for the first case since the proof for the second case is analogous. 
By Proposition 2, using the same notation, if Cfc(a,m + 1) 6 [{ — (m + l)a}, {— ma}) 
then a n+m = a otherwise a„+ m = b. If Cfe(a, m + 1) G [{— (m + l)a}, {—ma}) we also 
have that Lfc(a, m + 1) = [cfc(a, m + 1), Ck+i(ot, m + 1)) = [cfe_i(a, m), Cfe(a, m)) except 
in the case Cfc(a, m+1) — {— (m+ l)a} where Lk(a, m + 1) = [{— (m + l)a}, Cfc(a, m)) C 
[ c k-i{oL, m), Cfe(a, m)). This is because in order to obtain the sequence Cfe(a,m + 1) we 
add the point { — (m + l)a} to the sequence Cfc(a, m), and, consequently, the index of 
all points in the original sequence greater than { — (m + l)a} is increased by one in 
the newly obtained sequence. Therefore, the Parikh vector of the factors corresponding 
to the intervals Lfc(a,m + 1), where Cfc(a, m + 1) G [{ — (m + l)a},{— ma}), is equal 
to t>2(a,m) + (1,0), since a m+ i = a. For the other intervals, since a m+ i = b, the 
Parikh vector of the factors corresponding to the intervals is equal to i>2(a, m) + (0, 1) if 
Cfc(a, m + 1) < {— (m + l)a} < {—ma} or to «i(a, m) + (0, 1) if Cfe(a, m+1) > {—ma}. 

Putting all together, we have that t>2(a, m+1) = t>2(a, m) + (0, 1) while v\(a, m+1) = 
V2{a, m) + (1, 0) = i>i(a,m) + (0, 1), and, consequently, i>i(a,m)[l] = W2(a,m)[l] + 1. □ 

The reader can see in Fig. [4] that the factors of length 6 corresponding to an interval 
to the left of {—6(0 — 1)} have Parikh vector (3, 3), while the others have Parikh vector 
(4,2). 

We now address the following questions: 

1. Given m, how long can be the longest abelian power of period m in s a ? 

2. Given m, how long can be the longest abelian power of period m that is a prefix of 

s 1 

3. What can we say in the particular case of the Fibonacci word? 

We start with a corollary, whose proof comes straightforwardly from Proposition |4j 

Corollary 2. Let w be an abelian power of period m and exponent k + 1 appear- 
ing in s a in position n. Then all the points in the sequence {na},{(n + m)a},{(n + 
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2m)a}, . . . , {(n + km)a} are in the same subinterval in which [0, 1) is subdivided by the 
point {—ma}, that is, either [0, {—ma}) or [{— ma},l). 

Proposition 5. If k > 1, the k + 1 points of Corollary^ are naturally ordered. That 
is, if {ma} < 0.5, then they are all in the subinterval [0, {—ma}) and one has {na} < 
{(n + m)a} < ■ ■ • < {(n + km)a}; if {ma} > 0.5 then they are all in the interval 
[{—ma}, 1) and one has {(n + km)a} < {(n + (k — l)m)a} < • • ■ < {na}. 

Theorem 1. Let m be a positive integer such that {ma} < 0.5 (resp. {ma} > 0.5). 
Then: 

1. In s a there is an abelian power of period m and exponent k > 2 if and only if 
{ma} < jr (resp. {—ma} < i). 

2. If in s a there is an abelian power of period m and exponent k > 2 starting in position 
i with {ia} > {ma} (resp. {ia} < {ma}), then {ma} < -j^j (resp. {—ma} < jrj^r 
). Conversely, if {ma} < ^rj-j- (resp. {—ma} < then there is an abelian power 
of period m and exponent k > 2 starting in position m. 

Proof. (Sketch) Let m be a positive integer such that {ma} < 0.5. If in s a there is an 
abelian power of period m and exponent k > 2, then by previous proposition, all the k 
points of Corollary [2] are naturally ordered and they must lie in the interval [0, {—ma}). 
The distance between any two consecutive such points is {ma}. Therefore, the size of 
this interval, that is equal to 1 — {ma}, must be smaller than (fc — l){ma}. From this 
relation, one derives that {ma} < 1/k. 

Conversely, suppose that {ma} < l/k. Since a is irrational, the sequence {ia} is 
dense in [0,1) (see (22l Chap. XXIII]). Therefore, one can find a number i close to 
such that the points of Corollary [2] all lie in the interval [0, {—ma}). The thesis follows 
by Proposition |4] The case {ma} > 0.5 can be proved in an analogous way. 

Also the second part of the theorem is dealt in a similar way. The only difference is 
that the first point of the sequence cannot be close to but has to be far away at least 
{—ma}, and this fact leads to the stronger condition {ma} < l/(k + 1). □ 

The previous theorem allows us to deal with abelian repetitions in a Sturmian word 
s a by using classical results on the approximation of the irrational a by rationals. This 
is a classical topic in Number Theory. Since the number cf> — 1 has special properties 
within this topic, we have in turn specific results for the Fibonacci word. 



4 Approximating irrationals by rationals and abelian 
repetitions 

We recall some classical results of Number Theory. For any notation non explicitly 



defined in this section we refer to 22 Chap. X, XI 



The sequence F = l,Fi = 1, Fj + i = Fj + Fj-\ for j > 1 is the well known sequence 
of Fibonacci numbers. The sequence of fractions converge to <f> = ^ , while the 

sequence -J^— converges to 6 — 1 = "^T 1 . Moreover, the sequences and = §, 

F 3 , j = 0, 1, . . ., are the sequences of convergents, in the development in continued 
fractions, of <f> and <f> — 1 respectively. 

Concerning the approximation given by the above convergents, the following result 



holds (see 22 Chap. X, Theorem 171] and 22 Chap. XI, Section 11.8]) 
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Theorem 2. For any j > 0, 



F 



Fj 



Fa 



(-iy 



We also report the following theorems (see 22 Chap. XI, Theorem 193] and the 
proof of [22) Chap. XI, Theorem 194]). 

Theorem 3. Any irrational a has an infinity of approximations which satisfy 



< 



1 



Theorem 4. Suppose a = 



Vlq 2 ' 

1 . If A > the inequality 



a 



< 



Aq 2 



has only a finite number of solutions. 



The last two theorems coupled with the first part of Theorem [T] allow us to derive 
the next result. 

Theorem 5. Let s a be a Sturmian word. For any integer q > 1, let k q be the maximal 
exponent of an abelian repetition of period q in s a . Then 

k 

lim sup — > \/5 

q 

and the equality holds when a = (f> — 1. 

Proof. (Sketch) Pick an approximation p/q of a such that \p/q — a\ < 1/ \/5q 2 . Suppose 
that p/q — a < 0, the proof when p/q — a > is analogous. 

We have that qa — p < l/Vbq < 0.5. Hence, {qa} < l/Vbq and, by Theorem [I] 
there is in s a an abelian repetition of exponent VEq and period q. 

If limsupfc g /g = A > y/E then, by Theorem [l] the inequality \p/q — a\ < l/Aq 2 has 
infinitely many solutions. This is impossible by Theorem [4] □ 

We now study the abelian repetitions that are prefixes of the Fibonacci infinite word. 
For this, we will make use of the second part of Theorem [T] 

Notice that an abelian repetition of period m appearing as a prefix of the Fibonacci 
word can have a head of length equal to m — 1 at most. Therefore, we have to check all 
the abelian powers that start in position i for every i = 1, . . . , m. 

In order to do this, we report here another result (see 22 Chap. X, Theorem 182]). 

Theorem 6. Letp n /q n be the n-th convergent to a. If n > 1, < q < q n and p/q ^ 
p n /q n , then \p n - q n a\ < \p - qa\. 

We translate the statement of the theorem above in terms more convenient to our 
setting. 

Corollary 3. Suppose that m > 1 is the denominator of a convergent to a and that 
{ma} < 0.5 (resp. {ma} > 0.5). For any i, 1 < i < m, one has {ia} > {ma} 
(resp. {ia} < {ma}). 
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From the previous corollary, we have that if to > 1 is a Fibonacci number, for 
a = <f> — 1 the hypotheses of the second part of Theorem [I] are satisfied. The next 
proposition is a direct consequence of the previous corollary, Theorem [I] and Theorem 

m 

Proposition 6. Let j > 1. In the Fibonacci infinite word, the longest abelian power 
having period Fj and starting in a position i < Fj has an occurrence starting in position 
Fj and has exponent 



L^ + ^-ij-i 




1 if j is even; 

2 if j is odd. 



The following theorem provides a formula for computing the length of the longest 
abelian repetition appearing as a prefix in the Fibonacci infinite word. 

Theorem 7. Let j > 1. The longest prefix of the Fibonacci infinite word that is an 
abelian repetition of period Fj has length Fj(Fj+±+Fj—i + l) — 2 if j is even or Fj(Fj + \ + 
Fj-±) — 2 if j is odd. 

Proof. (Sketch) It is enough to prove that the length of the head and of the tail can 
be maximal (i.e., the head and the tail both have length Fj — 1) in the case of the 
maximal exponent described in Proposition [6| Indeed, by Proposition [4] and Theorem 
[6j there exists exactly one factor v that has a Parikh vector different from the Parikh 
vector Q of all the other factors, including the factor that is the prefix of / of length Fj. 
The corresponding interval of this unique factor is either Lo(a,Fj) or Lp.(a, Fj), and 
therefore it is not the interval cf> — 1 belongs to. 

Therefore, the Parikh vector of the prefix of length Fj of / is Q and the Parikh vector 
of the prefix of length Fj — 1 of / is V C Q. This proves that the head has maximal length. 
Since the exponent considered in Proposition [6] is maximal, the factor of length Fj must 
be the unique factor v with a different Parikh vector. But, by geometric considerations, 
the prefix of v of length |u| — 1 = Fj — 1 has other different factors of the same length 
that share its Parikh vector V. These other factors are prefixes of at least one factor of 
length Fj that has Parikh vector Q. Therefore, V C Q. This proves that also the tail 
has maximal length. □ 

Corollary 4. Let j > 1 and let kj be the maximal exponent of a prefix of the Fibonacci 
word that is an abelian repetition of period Fj . Then 

li m k = ^5. 

j-s-oo Fj 

Proof. We proved in a previous theorem that over a bigger set of repetitions the superior 
limit tends to Therefore, in this case, it must be smaller than or equal to y/E. The 
equality follows from the fact that F: >+ 1 ^ Fj - 1 converges to \fh. Since the sequence 

F ■ 

converges to <j) and the sequence converges to (j> — 1, then the sequence 



F 3 + l + F 3-l _ F 3 + l , F 3 

converges to 



F- F F 

r 3 r 3 r 3 



\/5 + l 



V5. 



□ 
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f„\ abaa6abaa6aabafiaaba6aabaaba^ ■ ■ ■ 
(c) I I I I I I I 

5 10 15 20 25 30 35 40 45 50 55 60 65 

(l\ aBaaBabaabiiaba6aa6aba ■ ■ ■ 
(b) | | | 

5 10 15 20 25 

t„\ a6aaBa5aabaab ■ ■ ■ 
( a ) | i n | 

5 10 15 



Fig. 5. Longest abelian repetition of period m that is a prefix of the Fibonacci word for m — 
2, 3, 5. (a) For m = 2, the longest abelian repetition has length 8 = 1 + 3p + 1. (6) For m — 3, 
the longest abelian repetition has length 19 = 2 + 5p + 2. (c) For m — 5, the longest abelian 
repetition has length 58 = 4 + lOp + 4. 



3 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


Fj 


2 


3 


5 


8 


13 


21 


34 


55 


89 


144 




8 


19 


58 


142 


388 


985 


2616 


6763 


17798 


46366 


iVB-kj/Fjl * 10 2 


23.6 


12.5 


8.393 


1.732 


5.98 


0.25 


2.69 


0.037 


1.087 


0.005 



Table 1. The length of the longest prefix (lp(Fj)) of the Fibonacci word having abelian period 
Fj for j = 2, . . . , 11. The table also reports rounded distances (multiplied by 10 2 ) between \/5 
and the ratio between the exponent kjlp(Fj)/ Fj of the longest prefix of the Fibonacci word 
having abelian period Fj and Fj (see Corollary [4| . 



In Fig. [5j we give a graphical representation of the longest prefix of the Fibonacci 
infinite word that is an abelian repetition of period m for m = 2, 3 and 5. 

In Table [TJ we give the length lp(Fj ) of the longest prefix of the Fibonacci infinite 
word that is an abelian repetition of period Fj, for j = 2, . . . , 11, computed using the 
formula of Theorem [7) We also show the values of the distance between \fb and the ratio 
between the maximal exponent of a prefix of the Fibonacci infinite word having abelian 
period Fj and Fj. 

Recall that the Fibonacci (finite) words are defined by fo = b, /1 = a, and for every 
j > 1, fj+i = fjfj-v So, for every j, one has \fj\ = Fj. 

As a consequence of the formula given in Theorem [7J we have the following result on 
the smallest abelian periods of the Fibonacci words. 

Theorem 8. For j > 3 ; the (smallest) abelian period of the word fj is the n-th Fibonacci 
number F n , where n = |j/2j if j — 0, 1, 2 mod 4, or n — 1 + L//2J if j — 3 mod 4. 

Proof. (Sketch) It is possible to prove that no abelian repetition that is a prefix of the 
Fibonacci infinite word of period m, with F n < m < F n +i, n > 2, has length greater 
than the length of the abelian repetition that is a prefix of the Fibonacci infinite word 
of period F n . Indeed, if m is such that F n < m < F n+ i, n > 2, one has that {m(4> — 1)} 
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is not too close to an integer and then it is possible to use the same technique developed 
in the proof of Theorem [I] By using Proposition [7J it is sufficient to find the smallest 
index n such that \fj\ — Fj is greater than the length of the longest abelian repetition in 
the Fibonacci infinite word of period F n -\. A computation that makes use of Theorem 
[2] gives the desired formula. □ 

For example, the abelian period f± = abaab is 2 = F2 = |_4/2J , since one can write 
fi = a • ba • ab; the abelian period of fa = abaababa is 2 = F 2 ; the abelian period of 
fe = abaababaabaab is 3 = F 3 ; the abelian period of f 7 = abaababaabaababaababa is 

5 = F 4 . 

In Table [2] we report the abelian periods of the first Fibonacci words. 

j 3 4 5 6 7 8 9 10 11 12 13 14 15 16 



a. p. of fj F 2 F 2 F 2 F 3 F 4 F 4 F 4 F 5 F 6 F 6 F 6 F 7 F s F 8 



Table 2. The (smallest) abelian periods of the Fibonacci words fj for j = 3, . . . , 16. 
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